Engineers, Aware! Commercial Tools Disagree on Social Media Sentiment: Analyzing the Sentiment Bias of Four Major Tools
نویسندگان
چکیده
Large commercial sentiment analysis tools are often deployed in software engineering due to their ease of use. However, it is not known how accurate these are, and whether the ratings given by one tool agree with those another tool. We use two datasets - (1) NEWS consisting 5,880 news stories 60K comments from four social media platforms: Twitter, Instagram, YouTube, Facebook; (2) IMDB 7,500 positive negative movie reviews investigate agreement bias widely used (SA) tools: Microsoft Azure (MS), IBM Watson, Google Cloud, Amazon Web Services (AWS). find that assign same on less than half (48.1%) analyzed content. also AWS exhibits neutrality both datasets, bi-polarity dataset but dataset, MS exhibit no clear have dataset. Overall, has highest accuracy relative ground truth Findings indicate psycholinguistic features especially affect, tone, adjectives explain why disagree. Engineers urged caution when implementing SA for applications, as selection affects obtained labels.
منابع مشابه
Challenges of Evaluating Sentiment Analysis Tools on Social Media
This paper discusses the challenges in carrying out fair comparative evaluations of sentiment analysis systems. Firstly, these are due to differences in corpus annotation guidelines and sentiment class distribution. Secondly, different systems often make different assumptions about how to interpret certain statements, e.g. tweets with URLs. In order to study the impact of these on evaluation re...
متن کاملPotential and Limitations of Commercial Sentiment Detection Tools
In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. In addition to the quality analysis (measured by various metrics), we also investigate the effect of increasing text length on the performance. Finally, we show that combining all tools using machine...
متن کاملMeta-Classifiers Easily Improve Commercial Sentiment Detection Tools
In this paper, we analyze the quality of several commercial tools for sentiment detection. All tools are tested on nearly 30,000 short texts from various sources, such as tweets, news, reviews etc. The best commercial tools have average accuracy of 60%. We then apply machine learning techniques (Random Forests) to combine all tools, and show that this results in a meta-classifier that improves ...
متن کاملBenchmarking Twitter Sentiment Analysis Tools
Twitter has become one of the quintessential social media platforms for user-generated content. Researchers and industry practitioners are increasingly interested in Twitter sentiments. Consequently, an array of commercial and freely available Twitter sentiment analysis tools have emerged, though it remains unclear how well these tools really work. This study presents the findings of a detailed...
متن کاملA Sentiment-Aware Approach to Community Formation in Social Media
Participating in a community exemplifies the aspect of sharing, networking and interacting in a social media system. There has been extensive work on characterising on-line communities by their contents and tags using topic modelling tools. However, the role of sentiment and mood has not been studied. Arguably, mood is an integral feature of a text, and becomes more significant in the context o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ACM on human-computer interaction
سال: 2022
ISSN: ['2573-0142']
DOI: https://doi.org/10.1145/3532203